A Rejection Method for the Isolated Word Recognition System

نویسندگان

Dong-hwa Kim

Young-Ho Kim

چکیده

M efficient rejection method is implemented for the HMM based small vocabulary isolated word recognition system. Six clustered phoneme models are generated using statistical method from the 45 context independent Korean phoneme models which were trained using the phonetically balanced Korean speech database and the classification through likelihood ratio scoring is performed based on the clustered models. me performance test for speaker independent isolated words recognition task on the 22 section names shows that our method is superior to the classification based on the likelihood scores of the first and the second candidates. 1. ~TRODUC~ON The small vwabulary isolated word recognition system as well as large vocabulary continuous speech recognition system can be applicable in many areas. In practice, users of isolated word recognition system tend to speak unregistered vocabulary words owing to carelessness or ignorance. For the isolated word recognition system to be practical, the ability to reject unregistered vocabulary is necessarily required. filler models has been used commonly in the HMM based ke~ord spotting systems to represent non-kepords [1]. This method often leads to explicitly train the filler models with extraneous speech and non speech database [2]. An alternative technique uses the difference in log-likelihood of the two highest ranking keywords [3]. Although this technique reduces the ke~ord rejection error rate, false alarm ra(e increase to a large amount [4]. This paper describes an efftcient rejection method in the speaker independent isolated word recognition system using clustered phoneme models similar to filler models in the keyword spotting system and the results of assessing the rejection capabilities. 2. CLUSTEWNG AND REJEC~ON ALGOWTHM The basic idea of our rejection method lies in the using models with smoothing effects i.e., the models that can represent both registered and unregistered vocabulary to a certain extent. We use 6 clustered phoneme models derived from the 46 context independent phoneme models that were trained using the phonetically balanced Korean speech database. Monophone clustering algorithm[5] is used to generate the 6 clustered models. The distance measure of this algorithm is as follow: D(Pi, Pj) = ~ D (Pi, Pj), d=~ d (1) where Pi, P, are the i* and jh phonemes and N is the number of states in a phoneme model. D~(P,,P,) is the distance between each states of two phonemes, defined as v (midk mj& )* Dd(Pi, Pj) =1 z ‘k=[ ‘idk 9 ‘jdk (2) where V is the dimension of observation vectors, m,~ and S* are the mean and standard deviation of the dm state of the i“ phoneme model. The phoneme models are clustered using the K-means algorithm and the distance measure defined in (1). With the information of clustering, six phoneme models are generated through retraining. The rejection method that has been used commonly is to reject the out of vocabulary according to the difference of Viterbi scores between the first and second candidates. Our method uses the scores of clustered phoneme models and that of whole word mdels simultaneously and the decision of rejection is performed by applying a threshold,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rejection measures for handwriting sentence recognition

In this paper we study the use of confidence measures for an on-line handwriting recognizer. We investigate various confidence measures and their integration in an isolated word recognition system as well as in a sentence recognition system. In isolated word recognition tasks, the rejection mechanism is designed in order to reject the outputs of the recognizer that are possibly wrong, which is ...

متن کامل

Holistic Farsi handwritten word recognition using gradient features

In this paper we address the issue of recognizing Farsi handwritten words. Two types of gradient features are extracted from a sliding vertical stripe which sweeps across a word image. These are directional and intensity gradient features. The feature vector extracted from each stripe is then coded using the Self Organizing Map (SOM). In this method each word is modeled using the discrete Hidde...

متن کامل

Comments on 'An improved endpoint detector for isolated word recognition'

Accurate location of the endpoints of an isolated word is important for reliable and robust word recognition. The endpoint detection problem is nontrivial for nonstationary backgrounds where artifacts (i.e., nonspeech events) may be introduced by the speaker, the recording environment, and the transmission system. Several techniques for the detection of the endpoints of isolated words recorded ...

متن کامل

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

تشخیص دست‌نوشتۀ‌ برخط فارسی با استفاده از مدل زبانی و کاهش قوانین نگارش کاربر

The Joint-up, cursive form of Persian words and immense variety of its scripts, also different figures of Persian letters depending on their sitting positions in the words, have turned the Persian handwritings recognition to an intense challenge. The major obstacle of the most often recognition ways, is their inattention to sentence contexture which causes utilizing of a word with correct appea...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

A Rejection Method for the Isolated Word Recognition System

نویسندگان

چکیده

منابع مشابه

Rejection measures for handwriting sentence recognition

Holistic Farsi handwritten word recognition using gradient features

Comments on 'An improved endpoint detector for isolated word recognition'

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

تشخیص دست‌نوشتۀ‌ برخط فارسی با استفاده از مدل زبانی و کاهش قوانین نگارش کاربر

عنوان ژورنال:

اشتراک گذاری